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ABSTRACT 

We have derived the angular correlation function of a sample of 2096 sources detected 
in the ROSAT All Sky Survey Bright Source Catalogue, in order to investigate the 
clustering properties of AGN in the local Universe. Our sample is constructed by 
rejecting all known stars, as well as extended X-ray sources. Areas with \b\ < 30° 
and declination 6 < —30° are also rejected due to the high or uncertain neutral 
hydrogen absorption. Cross-correlation of our sample with the Hamburg/RASS optical 
identification catalogue, suggests that the vast majority of our sources are indeed 
AGN. A 4.1cr correlation signal between 0° and 8° was detected with w{6 < 8°) = 
2.5±0.6 x 10~ 2 . Assuming a 2-point correlation function of the form w(6) — (6>/# )~ ' 8 , 
we find 9 = 0.062°. Deprojection on 3 dimensions, using the Limber's equation, yields 
a spatial correlation length of r ss 6.0±1.6 hr 1 Mpc. This is consistent with the AGN 
clustering results derived at higher redshifts in optical surveys and suggests a comoving 
model for the clustering evolution. 

Key words: surveys-galaxies:active-quasars:general-large scale structure of the 
Universe- X-rays:general 



1 INTRODUCTION 

Active Galactic Nuclei (AGN) can be detected at high red- 
shift due to their high luminosities and hence they can be 
used to place stringent constraints on the evolution of Large 
Scale Structure over a wide redshift range (see Hartwick et 
Schade 1989). Our knowledge on the AGN clustering prop- 
erties comes mainly from large optical UV excess (UVX) 
surveys for QSOs (Boyle et al. 1988, Fang & Mo 1993, 
Shanks & Boyle 1994, Croom & Shanks 1996, La Franca, An- 
dreani & Cristiani 1998). Croom & Shanks (1996) analysed 
the clustering properties of the Large Bright Quasar Survey 
(LBQS) and combined the results with that obtained from 
other QSO surveys including the Durham/AAT UVX sam- 
ple. They derived a clustering length of r = 5.4 ± 1.1 h^ 1 
Mpc at a mean redshift of 1.27. La Franca, Andreani & Cris- 
tiani (1998) derived comparable results (r = 6.2 ± 1.6 h 
Mpc) by investigating a sample of ~ 700 quasars in the 
redshift range 0.3 < z < 3.2. Comparison of these clus- 
tering results in different redshifts rather favours a comov- 
ing model for the evolution of clustering. According to this 
model the amplitude of the correlation function remains 
fixed with redshift in comoving coordinates as the galaxy 
pair expands together with the background mass distribu- 
tion. In contrast, in the stable model for clustering evolu- 



tion, AGN trace clumps of mass which have gravitationally 
collapsed in bound units and have therefore ceased to take 
part in the general expansion of the universe. However, most 
of the AGN samples used in the analyses above, come from 
pencil beam surveys and contain a large fraction of high red- 
shift AGN. Georgantopoulos & Shanks (1994) studied the 
correlation function of a sample of about 200 IRAS selected 
Seyfert galaxies. They detect a 2a clustering signal on scales 
less than 10 h~ x Mpc. Although their statistics are limited, 
their results are more consistent with a comoving model. It 
is evident that there is a pressing need for large AGN sam- 
ples in the local universe in order to place tight constraints 
on the clustering evolution in a broad redshift range. 

Analogous clustering studies in X-rays have been scarce 
but they are potentially interesting as they provide the op- 
portunity to derive information on how the X-ray selected 
AGN trace the underlying mass distribution. The first direct 
study (using redshift information) of the correlation function 
and clustering properties of X-ray selected AGN is that of 
Boyle & Mo (1993). They studied the local z < 0.2 AGN 
clustering properties using a sample from the Einstein Ex- 
tended Medium Sensitivity Survey (EMSS). They did not 
detect clustering at a statistically significant level (~ 1.5er). 
Vikhlinin & Forman (1995) analysed the angular clustering 
properties of a set of deep ROSAT observations. Their an- 
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gular correlation length o translates, using Limber's equa- 
tion (Peebles 1980), to a spatial correlation length r which 
is consistent with that derived in optical suveys. Finally, 
Carrera et al (1998) using a sample of 235 AGN from two 
different soft X-ray surveys, the ROSAT Deep Survey (Geor- 
gantopoulos et al., 1996) and the RIXOS survey (Mason et 
al. 2000) derived the first direct evidence (ie., in 3 dimen- 
sions) for clustering in an X-ray selected QSO sample. They 
derived a low spatial correlation length between 1.5 and 5.5 
hT x Mpc depending on the adopted model of clustering evo- 
lution. 

In this paper we present a study of the clustering prop- 
erties of soft X-ray selected AGN from the ROSAT All Sky 
Survey (RASS). The large number of AGN in this sample 
provides excellent statistics and hence the opportunity to 
put stringent constraints on the clustering properties in the 
local X-ray Univerce. Moreover, comparison with cluster- 
ing results at higher redshift in both the X-ray and optical 
regime could provide valuable information on the evolution 
of clustering in the universe. 
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Figure 1. AitofT projection of our final catalogue (2096 sources) 
in Galactic coordinates. The center corresponds to Galactic lon- 
gitude and latitude 1=0, b=0 while 1 increases to the left 



2 THE ROSAT SAMPLE 

The data used come from the ROSAT All-Sky Survey 
Bright Source Catalogue, RASSBSC, (Voges et al. 1999). 
The RASSBSC is a subsample of the brightest 18811 sources 
in the RASS. The RASS covers 98 per cent of the sky in 
the 0.1-2.4 keV energy range with 30 arcsecs angular res- 
olution. We excluded all sources with count rate less than 
O.lcts s^ 1 . This yields a catalogue which has a uniform count 
rate limit over the sky (Voges et al. 1999). We further ex- 
clude all sources between the strip (-30,30) degrees in Galac- 
tic latitude in order to minimize the effect of hydrogen ab- 
sorption. Although the majority of the RASS sources are 
AGN there is an appreciable contamination from stars and 
clusters of galaxies. Therefore in order to study the AGN 
cross-correlation function we first need to exclude all stars, 
groups and clusters of galaxies. In principle, one can utilise 
the Hamburg Optical Identification Catalogue (Bade et al. 
1998) which is an ongoing spectroscopic follow-up program 
of the RASS sources in the Northern sky, containing objec- 
tive prism spectra for several thousand sources. However, 
as the Hamburg catalogue contains optical identifications 
only for a small subsample of the RASSBSC, we choose 
to use a different approach to exclude the stellar and ex- 
tended objects from our sample. We have therefore cross 
correlated the RASSBSC with known stellar catalogues, 
such as the Smithsonian Astrophysical Observator y Stellar 
Catalogue ( rrttp://xcna.harvard.cdu /softwarc/sao[) and th e 
STScI Guide Star Catalogue ( |rttp!77www^gsss. stsci.edu ) . 
In addition we have excluded all known ROSAT White 
Dwarfs, Cataclysmic Variables, X-ray Binaries and hot (OB) 
stars. Details of these ROSA T catalogues are given in the 
GSFC HEASARC web page http://heasarc.gsfc.nasa.gov. 



The above cross-correlation resulted in the rejection of 6246 
stars. The next step is to remove all known clusters and 
groups of galaxies from our sample. Inspection of the Ham- 
burg catalogue shows that all identified clusters appear to 
be spatially extended in X-rays. Hence we further exclude 
all sources which have an extension flag value greater than 
40 (see Voges et al. for the definition of the extension flag 



value). Indeed the cross correlation of our sample with the 
Hamburg RASS catalogue in the common (w 8000 degrees) 
area shows no remaining clusters. After the application of 
the above corrections the vast majority of our sample, at 
least in the areas of the sky covered by the Hamburg cat- 
alogue, consists of AGN (86 per cent), while there is still 
some small stellar and galaxy contamination (about 10 and 
4 per cent respectively). 

Finally, we need to make corrections for the neutral 
hydrogen in the Galaxy. The 0.1-2.4 keV band is very sensi- 
tive to photoelectric absorption. Therefore the apparent sky 
density may change in different areas on the sky due to the 
changes in the hydrogen column density (Nh) in our Galaxy. 
The large scale variations of the Nh with Galactic latitude 
as well as the patchy absorption on smaller scales may intro- 
duce spurious signals in the correlation function. Therefore, 
we used the Leiden/Dwingeloo Atlas of Galactic Neutral 
Hydrogen, (Hartmann & Burton 1997) in order to remove 
the effects of the hydrogen absorption. As the above atlas 
contains Nh data only for declinations <5 > —30° we further 
exclude RASSBSC sources outside the missing regions. The 
resulting final catalogue contains 2096 sources over 4.9 sr. 
In Fig. ll we present the Aitoff projection of our sample in 
Galactic coordinates. The apparent large-scale density gra- 
dients, seen in Fig^, are due to patchy Galactic absorption. 
In Fig. ^ we present the sky density as a function of the 
Galactic column density Nh- The solid line represents the 
best fit to the data. This is given by log e N — a + {3Nh 
where N is the density of galaxies per steradian and Nh is 
the column density in units of 10 20 cm -2 . We find a = 6.7 
and (3 = -0.2. 



3 METHOD AND ANALYSIS 

We used the two point angular correlation function to mea- 
sure the clustering properties of our sample. The correlation 
function w(6) is defined in terms of the propability SP(0) 
of finding two objects at a separation 6 in two small solid 
angles SQi and 5^2- 
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Figure 2. The AGN surface density as a function of hydrogen 
absorption Nh- The errorbars represent Poisson uncertainties. 



sp = n (i + w(e))sn 1 sn 2 



(1) 



where n is the mean number density of this objects. For an 
unclustered population w(8)=0, while positive or negative 
values of w(8) indicate clustering or anticlustering respec- 
tively. We estimated the correlation function by comparing 
the distribution of our sample to that of a random sample. 
The random sample was constructed by generating angu- 
lar positions at random over the sky and then folding them 
through the Nh map. In particular, the points on the sky 
with the lowest Nh are assigned a probability 1 (always ac- 
cepted) while points on the sky with higher Nh are assigned 
lower probabilities, following the N vs. Nh fit of Fig. ^, and 
therefore have a lower chance of entering the random cata- 
logue. The correlation estimator factor that we use is 



w{0) = f{N id /N ir ) - 1 

while the error is given by (Peebles 1973) 

o w = y/f(l + w(0))/N dr 



(2) 



(3) 



Here Ndd is the number of data-data pairs and Nj, r is 
the number of data-random pairs for a given separation. We 
use a random sample 30 times larger than our observed cata- 
logue. Then the normalization factor is / = 2N d N r / 'Nd(Nd- 
1) the ratio of the total number of independent data-data 
pairs to the number of data-random pairs. In the equation 
above N d and N r are the number of data and random (after 
folding through the Nh distribution) points respectively. 

The derived correlation function is presented in Fig. 
U We find N dd = 26669 and N dr = 26004 hence detect- 
ing a significant clustering at the 4.1a confidence level (us- 
ing Poissonian statistics) between and 8 degrees with 
w(8 < 8°)=2.5±0.6 x 10 -2 . On larger scales the correlation 
function is consistent with an unclustered population. As- 
suming a 2-point angular correlation function of the form 
w(8) = (6>/6>o) 1-7 we find that y - 1 = 0.9 ± 0.15 and 
8 — 0.08 ± 0.03 degrees. If the value of 7 — 1 is fixed at 
0.8, which is the value found for optically selected quasars, 
we find 8 = (0.06 ± 0.02) degrees. Next, we performed a 
test in order to check whether our signal is diluted due to 
the small percentage of stellar contamination. We estimated 
the correlation function for all the identified AGN in the 



Hamburg/RASS Catalogue. There were 662 AGN with dec- 
lination above —30°. The estimated correlation function is 
wagn(8) = 3.7 ± 1.0 x 10~ 2 . This is consistent with the 
w(8) of our sample, within ~ 1.2cr, suggesting that the stel- 
lar contamination does not have a major effect on the de- 
rived correlation amplitude. We also note that our results do 
not suffer from the possible 'amplification bias'. Vikhlinin & 
Forman (1995) pointed out that when the best-fit angular 
correlation length is smaller than the FWHM of the ROSAT 
PSPC Point Spread Function, two or more sources can be 
detected as a single source. As a result the distribution of the 
confused sources may be biased with respect to the correla- 
tion function of the parent population. This 'amplification 
bias' effect works towards the amplification of the correla- 
tion function amplitude (cf. Kaiser 1984). However, in our 
case, confusion problems are very unlikely as the FWHM of 
the ROSAT PSPC (~ 30 arcsec) corresponds to a very small 
spatial separation (<100 kpc) at the typical redshift of the 
RASS AGN (z ~0.1-0.2). 

Next, we use Limber's equation which gives the relation 
between the spatial and angular correlation function in order 
to calculate the correlation length r in three dimensions. 
We have used a power-low shape for the spatial two point 
correlation function following Groth & Peebles (1977): 



(4) 



where r is the correlation length in comoving coordinates, 
the parameter 7 is fixed at the value of 1.8 and p is the 
parameter describing clustering evolution. If p = the clus- 
tering is constant in comoving coordinates (comoving clus- 
tering). If p = 1.2 then the clustering is constant in proper 
coordinates (stable clustering). The amplitude 8 in two di- 
mensions is related to the correlation length r a in three di- 
mensions through the equation (Peebles 1980): 
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where H 7 = r(§)r(2f±)/r(2). The parameter y is related 
to the distance and redshift through 



y 



t (fi - 2)(1 + ilz) 1/2 + 2-Q. + Q.Z 
' Q?{l + z) 



(6) 



Finally, F(y) = [l-j/ 2 (fi- 1)] 1/2 , while <j>(y) determines 
the fraction of sources observable at a given y (or z), i.e., 
those with observed fluxes greater than the detection thresh- 
old. The cosmological parameter fl enters through the y — z 
relation, the volume factor F{y) and the selection function 
4>(y)', here we use Q — 1. The selection function <f>(y) depends 
also on the evolution of the AGN luminosity function. Boyle 
et al. (1993) derived the AGN cosmological evolution in the 
form of pure luminosity evolution, with the characteristic lu- 
minosity L* varying with redshift as (1 + z) 3 . The selection 
function can be written as 



4>{y) 



$(L x ,z)dL 



(7) 



where the local X-ray luminosity function is given by a dou- 
ble power-law function (Boyle et al. 1993). 

The lower limit of integration L m i n corresponds to the 
luminosity that can be observed at redshift z with a flux 
limit / ff. Note that the flux limit changes over the sky 
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Figure 3. The two-point angular correlation function w(9) of our 
sample (2096 sources). 

although the count rate limit is uniform, ie., 0.1 cts s _1 . 
This is because the flux limit depends sensitively on the col- 
umn density. For the conversion of the count rate to flux, a 
spectrum of T = 2 was assumed, while the appropriate Nh 
values were taken from Hartmann & Burton (1997). The ef- 
fective RASSBSC flux limit is then the average flux limit 
over the areas covered by our sample; we find f c g ~ 1CF 12 
erg cm -2 s _1 . 

We derive a value of r = 6.7 ± 1.0 h~ x Mpc assum- 
ing comoving clustering evolution and freezing the param- 
eter 7 at the value of 1.8. Alternatively, if we repeat our 
calculations using 7 ~ 1.9, which we derived from our un- 
constrained w(9) fit, we find r a = 5.4 ± 0.9 h^ 1 Mpc. We 
consider the difference Sr ~ 1.3 h^ 1 Mpc between these 
two determinations of r as a further source of uncertainty 
and we quote as our best estimate of r their average value: 
r ~ 6.0 ± 1.6 h^ 1 Mpc. The quoted uncertainty is derived 
by adding in quadrature Sr and the formal fitting errors. 
Note that in the case of stable clustering and for 7 = 1.8 we 
obtain r a = 6.5 ± 1.0 h' 1 Mpc. 



4 DISCUSSION AND CONCLUSIONS 

Assuming a 7=1.8 power law model we detect a significant 
clustering at 4.1a confidence level between and 8 degrees 
with w(6 < 8°) = 2.5 ± 0.6 x 10~ 2 . Our analysis of w(0) 
implies r = 6.0 ± 1.6 hT 1 Mpc. Note that our result comes 
from scales larger than ~2 degrees which corresponds to ~ 
10 h^ 1 Mpc at a redshift of z ~ 0.1. Although the statistics 
of our sample are limited, the lack of signal at small scales 
is significant at a high confidence level. This could imply 
a change in the form of the AGN correlation function at 
small separations. Our result represents the first significant 
detection of clustering in the local X-ray universe. The Boyle 
& Mo (1993) analysis of a sample of 183 low redshift (2 < 
0.2) AGNs, from the EMSS survey, represented the first X- 
ray selected AGN clustering study in the local Universe. 
They detected a weak but not significant clustering signal 
on scales < 10 _1 h^ 1 Mpc. 

Comparison of our findings with previous results at high 
redshift can give important clues on the evolution of clus- 
tering in the universe. We first compare with the results of 
Carrera et al. (1998) who analysed the clustering properties 



of two ROSAT, soft X-ray selected AGN samples at higher 
redshifts, the RIXOS and the DRS samples containing about 
235 AGN in total. They detect a 2a clustering signal in 
the RIXOS sample (mean z ~ 0.8) while no clustering is 
detected in the DRS sample. A correlation length of <3.5 
h _1 Mpc was found in the case of comoving evolution, sig- 
nificantly lower than our results. However, as their derived 
correlation length, r , may depend upon r c (the distance up 
to which the number of predicted and observed pairs arc 
compared in order to derive r ), larger samples are needed 
at high redshifts in order to constrain r . Better statistics 
can be provided by using optical samples. Croom & Shanks 
(1996) used a sample of ~ 1700 UVX AGN. They obtain a 
comoving clustering length of r = 5.4 ± 1.1 h^ 1 comoving 
Mpc at a mean redshift of z = 1.3. Therefore, comparison 
with our clustering length at low redshifts provides strong 
evidence for the comoving model of clustering evolution. 

Our derived AGN clustering length is similar to that of 
local galaxies. Hence it appears that AGN randomly sam- 
ple the galaxy distribution and they do not trace the high 
peaks of the density field despite their low density and high 
luminosity (cf Efstathiou & Rees 1988). This result is corrob- 
orated by the analysis of the cross-correlation of low redshift 
EMSS AGN with APM galaxies (Smith, Boyle & Maddox 
1995). They find that the cross-correlation amplitude is simi- 
lar to the galaxy auto-correlation amplitude thus supporting 
the idea that AGN randomly sample the galaxy population. 
This is again consistent with imaging studies of the AGN 
environments (cf. Boyle & Couch 1993). These authors find 
no excess number of galaxies around radio-quiet AGN, again 
pointing out that galaxies and AGN have similar environ- 
ments. 

Our work has important implications for the origin of 
the X-ray background (XRB). Several studies constrained 
the contribution of AGN to the XRB by using the auto- 
correlation function of the XRB fluctuations (eg Carrera 
& Barcons 1992, Georgantopoulos et al. 1993, Soltan & 
Hasingcr 1994). These studies conclude that if AGN have 
a clustering length of 6 Mpc similar to opticaly selected 
AGN and they evolve according to the comoving model they 
can produce only about half of the XRB intensity. Our re- 
sults appear to maintain the validity of this constraint. 

In the near future, we anticipate a drastic improvement 
of our knowledge of AGN clustering, at least in X-ray wave- 
lenghts. The RASS identification programmes will be essen- 
tial in providing a large AGN sample in the local universe 
with redshift information. The new ABRIXAS mission will 
be invaluable in providing local AGN samples in the hard X- 
ray band which is not prone to Galactic absorption. Finally, 
the XMM serendipitous surveys are expected to provide tens 
of thousands high redshift AGN thus strengthening the clus- 
tering statistics at high redshift. 
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